SCC

Author

Matteo Bellitti

Last Updated

January 21, 2025

Using SCC

“The Boston University Shared Computing Cluster (SCC) is a heterogeneous Linux cluster […] suitable for high-performance computing.”

To learn how to use SCC, check out the introductory guides on BU’s website.

In practice, you should run your code on SCC if you

Need to access one of the large datasets, and/or
Need a lot of computing power (CPU or GPU)

You should run on the workstations (or your own device)

For quick prototyping
If your analysis is light: if you’re using tabular data only or doing a regression analysis it’s probably fine.

Connecting to SCC

The most common ways of connecting to SCC are via SSH or using OnDemand.

SSH: gives you a terminal interface to SCC. Remember that you have to go through scc4.bu.edu to access the restricted data partitions. For more info, see the official BU guide.
SCC OnDemand: friendly web based interface. Lets you access the data in your home directory and the lab directory, also lets you run interactive apps (Jupuyter, VSCode, MATLAB, etc.) on SCC via browser. For more info, see the official BU guide. Really, give OnDemand a try, it’s awesome.

If you use VSCode, you can also connect to SCC using the “Connect to Host” command, which is very convenient for editing your code. Remember: do NOT use the login nodes for any heavy computation, your process will be terminated. When you’re ready to run your code, submit it as a batch job, request a shell, or use the interactive apps on SCC OnDemand.

Recommended SCC Workflow

Note

When submitting a job make sure to specify the project name. The global project name is vkolagrp

Using SCC

Connecting to SCC

Recommended SCC Workflow

Where is my data?